Apparatus, systems and methods for a thumbnail-sized scene index of media content

ABSTRACT

Scene-based program accessing systems and methods are operable to present a program at a scene corresponding to a selected thumbnail-sized image. An exemplary embodiment selects a plurality of image frames from a program based upon a scene separation duration; generates a thumbnail-sized image from each of the selected image frames, and presents the plurality of thumbnail-sized images on a scene index. The scene index is configured to present the plurality of thumbnail-sized images in a time ordered sequence corresponding to a subject matter presentation sequence of the program, and each of the selected image frames are temporally separated from each other by the scene separation duration.

PRIORITY CLAIM

This patent application is a Continuation of U.S. Non-Provisional patentapplication Ser. No. 12/903,941, filed Oct. 13, 2010, published as U.S.Publication No. 2012/0093476, and entitled “APPARATUS, SYSTEMS ANDMETHODS FOR A THUMBNAIL-SIZED SCENE INDEX OF MEDIA CONTENT,” the contentof which is herein incorporated by reference in its entirety.

BACKGROUND

Media devices, such as a set top box (STB), a stereo, a television, acomputer system, or the like, are often configured to receive streamingmedia content. Such media devices may receive media content from avariety of sources. For example, media content may be provided by acontent provider that is sourcing the media content to the media deviceover a high capacity media content delivery system, such as acable-based system, a fiber optic-based system, or a satellite-basedsystem. Another exemplary source of media content is a portable memorymedium, such as a compact disk (CD), digital video disk (DVD) or thelike. Yet another exemplary source is a web site that provides the mediacontent to the media device over a connection, such as the Internet.

As the media content is received, the media device may be configured bya user to store the received media content for later presentation. Forexample, the media device may have, or be coupled to, a digital videorecorder (DVR) or other suitable media content storage systems.Accordingly, at a later time, the user may access the stored mediacontent for presentation.

However, it may be relatively difficult for the user to appreciate thenature of the stored media content, other than by available meta datasuch as the media content title, actor identities, content descriptor,and/or one or more selected scene images. Furthermore, if the user isinterested in initiating playback of the media content at a particularscene, there is no convenient way for the user to access the mediacontent at the desired location of the scene of interest. In suchsituations, the user would initiate playback of the media content fromthe start of the media content, and then use a fast forward, skip ahead,or other playback features to advance through the media content to ascene where the user wishes to initiate or resume the playback of themedia content.

In another situation, the user may be watching media content such as amovie or a television program. At some point, the user may fall asleepor may be otherwise distracted. However, the user may not have pausedpresentation during play of the media content. In this situation, it isrelatively difficult for the user to return or rewind back to the scenewhere they stopped viewing the media content. Further, it may berelatively difficult for the user to precisely recall where they stoppedviewing the presented media content.

Accordingly, there is a need in the arts to provide a more user-friendlyway to enable a user to initiate or resume playback of selected mediacontent at a desired location in the media content.

SUMMARY

Systems and methods of accessing a program at a scene corresponding to aselected thumbnail-sized image are disclosed. An exemplary embodimentselects a plurality of image frames from a program based upon a sceneseparation duration; generates a thumbnail-sized image from each of theselected image frames, and presents the plurality of thumbnail-sizedimages on a scene index. The scene index is configured to present theplurality of thumbnail-sized images in a time ordered sequencecorresponding to a subject matter presentation sequence of the program,and each of the selected image frames are temporally separated from eachother by the scene separation duration.

BRIEF DESCRIPTION OF THE DRAWINGS

Preferred and alternative embodiments are described in detail below withreference to the following drawings:

FIG. 1 is a block diagram of an embodiment of a media content sceneindex system;

FIG. 2 conceptually illustrates portions of an exemplary media contentstream delivered to a media device;

FIG. 3 is a block diagram of an embodiment of the media content sceneindex system implemented in a set top box (STB);

FIGS. 4-8 are simplified screen shots illustrating various exemplaryalternative scene indexes generated by various embodiments of the mediacontent scene index system.

DETAILED DESCRIPTION

FIG. 1 is a block diagram of an embodiment of a media content sceneindex system 100. Embodiments of the media content scene index system100 may be implemented in a media device 102 that is configured toreceive a media content stream 104. An exemplary media device 102 mayinclude, but is not limited to, a set top box (STB), a digital videodisc (DVD) player, a digital video recorder (DVR), a game playingdevice, a mobile device (such as a tablet computer, smart phone or thelike), a personal computer (PC) or any other suitable electronic device.The media content stream 104 may comprise one or more streaming types ofmedia content. Exemplary media content streams 104 may include varioustypes of programs, such as movies, local programming, or special eventprograms.

In an exemplary media environment, the media device 102 is illustratedas an STB that is coupled to a media presentation system, such as theexemplary television (TV) 106 and a surround sound receiver 108. Theuser may watch the video portion of the presented media content on thedisplay 110 of the TV 106, and listen to the audio portion of thecurrently presented media content that is reproduced as sounds by thespeakers 112 of the surround sound receiver 108. The user may remotelyoperate the media device 102, the TV 106, and/or the surround soundreceiver 108 by actuating one or more of a plurality of controllers 114on a remote control 116. The remote control 116 emits a wireless signal118 that is received by the media device 102, the TV 106, and/or thesurround sound receiver 108. When an operating command in the wirelesssignal 118 is received, the controlled device becomes reconfigured tooperate in accordance with the received operating command. For example,the user may actuate one or more of the controllers 114 to cause themedia device 102 to retrieve and present a stored program of interestfrom a local memory medium 120 and/or remotely accessible memory medium.

Embodiments of the media content scene index system 100 are configuredto automatically select a time-ordered series of still images and/orshort duration video clips that correspond to selected scenes in thesubject matter presentation sequence of a selected program of interest.Each of the still images and/or video clips are separated temporallyfrom each other by a suitable scene separation duration. Accordingly,the time-ordered series of still images and/or video clips, eachcorresponding to a scene of the program, substantially span the durationof the selected program of interest.

The selected still images and/or video clips are processed into asmaller “thumbnail” sized still image and/or video clip. The pluralityof thumbnail-sized still images and/or video clips are presented on ascene index 122. The scene index 122 is configured to present theplurality of thumbnail-sized images and/or video clips in a time orderedsequence corresponding to the subject matter presentation sequence ofthe selected program of interest.

The exemplary scene index 122 of FIG. 1 presents fourteenthumbnail-sized still images and/or video clips (denoted as blockslabeled “A” through “N”). Each of the thumbnail-sized still imagesand/or video clips A-N correspond to a particular scene in the selectedprogram of interest. The scenes of the thumbnail-sized still imagesand/or video clips A-N are separated from each other by a suitableduration. Accordingly, the time-ordered series of presentedthumbnail-sized still images and/or video clips span a portion of, orall of, the selected program of interest. For example, the firstpresented thumbnail “A” corresponds to the start of the selected programof interest. The last presented thumbnail “N” corresponds to theconclusion of the selected program of interest.

Upon viewing the time-ordered series of thumbnail-sized still imagesand/or video clips presented on the scene index 122, the usercomprehends the general subject matter of the presented scenes of theselected program of interest. Since the thumbnail-sized still imagesand/or video clips span a substantial portion of (or all of) theduration of the selected program of interest, the user will alsoappreciate the subject matter presentation sequence of the selectedprogram of interest.

Additionally, the user viewing the presented scene index 122 comprehendsthat each of the presented series of thumbnail-sized still images and/orvideo clips are temporally separated from each other by the sceneseparation duration. In an exemplary embodiment that presents all of thethumbnail-sized still images and/or video clips generated for theselected program of interest, the user is able to estimate the time spanbetween each of the thumbnail-sized still images and/or video clipsbased on the number of presented thumbnail-sized still images and/orvideo clips in the scene index 122. Further, the user will alsoappreciate the temporal relationship of the presented subject matter ofthe selected program of interest since each of the thumbnail-sized stillimages and/or video clips presents a time-ordered series of scenes inthe selected program of interest.

For example, if the selected program of interest is a two hour longprogram, and fourteen thumbnail-sized still images and/or video clipsA-N are presented in the scene index 122 that are approximatelyseparated from each other by the same scene separation duration, theuser can estimate that the separation between each of thethumbnail-sized still images and/or video clips A-N is approximatelynine to ten minutes. (That is, a 120 minute movie with fourteenthumbnail-sized still images and/or video clips results in anapproximate nine minute duration between each presented thumbnail 124.)An exemplary embodiment may indicate the scene separation duration onthe scene index 122 so that the user is expressly informed of the sceneseparation duration between each of the presented thumbnail-sized stillimages and/or video clips.

In the exemplary embodiment of FIG. 1, the scene index 122 is presentedas a graphical user interface (GUI) that is interactive with the user.The user, via their remote control 116 that is in communication with themedia device 102, is able to interactively “scroll” or “navigate” aboutthe scene index 122 to highlight or otherwise indicate one of thethumbnail-sized still images and/or video clips presented on the sceneindex 122. When the user selects the highlighted or indicated one of thethumbnail-sized still images and/or video clips corresponding to a sceneof interest, presentation of the selected program of interest isinitiated or resumed at the scene that generally corresponds to theselected thumbnail-sized still image and/or video clip.

In practice, each of the thumbnail-sized still images and/or video clipsis associated with a corresponding playback starting point in theselected program of interest. Upon selection of one of thethumbnail-sized still images and/or video clips, embodiments identifythe location of the associated playback starting point in the selectedprogram of interest based upon the selected thumbnail-sized still imageand/or video clip. The selected program of interest is retrieved fromthe memory medium 120, or another suitable memory medium, andpresentation of the selected program of interest is initiated or resumedat the location of the playback starting point. That is, presentation isinitiated or resumed at the scene corresponding to the selected one ofthe thumbnail-sized still images and/or video clips presented in thescene index 122.

To illustrate operation of an exemplary embodiment of the media contentscene index system 100, the user may be watching a program of interest,such as a movie or the like, that has been previously stored, or that iscurrently being stored, in the memory medium 120. At some point duringpresentation of the program of interest, the user may fall asleep orotherwise become distracted so as to miss all of, or a part of, theremaining portion of the presented program. Later, after the user awakes(or at some other time), the user may wish to resume presentation of theprogram of interest at a scene that generally corresponds to when theyfell asleep (or at least at a scene where they are not able to recallthe presented subject matter of the program).

If presentation of the program of interest has concluded and the programof interest has been stored, the user accesses the memory medium 120 andselects the program of interest for presentation. In an exemplaryembodiment, upon selection of the program of interest for presentation,the media content scene index system 100 is invoked so that the sceneindex 122 is presented to the user. Alternatively, the scene index 122may be presented in response to a user request.

The user, upon viewing the thumbnail-sized still images and/or videoclips presented on the scene index 122, is able to comprehend the scenesequence of the selected program of interest. The user may then navigateto a particular one of the thumbnail-sized still images and/or videoclips where they wish to have presentation initiated or resumed. Uponselection of that particular thumbnail-sized still image and/or videoclip, the selected program of interest is retrieved from the memorymedium 120 and a corresponding playback starting point location in theselected program of interest is identified. Presentation of the selectedprogram of interest begins at the identified playback starting point.

In another situation, presentation of the program of interest may nothave yet concluded. When the user awakes or otherwise returns to theirviewing of the program of interest, the user may wish to rewind theprogram of interest to a scene that generally corresponds to when theyfell asleep (or at least at a scene where they are able to recall thepresented subject matter). When the media content scene index system 100is invoked by the user, a scene index 122 is presented that includesthose thumbnail-sized still images and/or video clips that are availableup to the extent of the currently presented scene of the program ofinterest. For example, if the currently presented program of interest isa two hour movie, and the movie is half way through its presentation(i.e., one hour), then the exemplary scene index 122 of FIG. 1 wouldpresent only those thumbnail-sized still images and/or video clips thatcorrespond to the first hour of the movie. That is, the thumbnail-sizedstill images and/or video clips A-H would be presented on the sceneindex 122. (The thumbnail-sized still images and/or video clips I-Nwould not be presented since they have not yet been generated by themedia content scene index system 100.)

To further describe operation of the exemplary scene index 122illustrated in FIG. 1, the thumbnail-sized still image and/or video clipG has been highlighted (conceptually indicated by a bold lined block G)by the user who has navigated about the scene index 122. Upon selectionof the thumbnail-sized still image and/or video clip G, presentation ofthe program begins or is resumed at the playback starting point thatgenerally corresponds to a location in the selected program of interestwhere the thumbnail-sized still image and/or video clip G was generated.Since the thumbnail-sized still image and/or video clip G is the sevenththumbnail of the fourteen thumbnails, the user appreciates thatpresentation of the program will begin or will resume at the scene thatgenerally corresponds to the midway point of the selected program ofinterest.

FIG. 2 conceptually illustrates an exemplary media content stream 104that is received by the media device 102. The media content stream 104comprises a series of programs. An exemplary program may include, but isnot limited to, a television program, a newscast, a broadcast sportsevent, a movie, or the like. The exemplary media content stream 104 isillustrated as comprising a video stream portion 202, an audio streamportion 204, and an optional meta data stream portion 206.

The conceptualized video stream portion 202 comprises a series ofserially communicated image frames 208. Each of the image framescorresponds to a scene image of the program. For example, if the motionpictures expert group (MPEG) format is used, the image frames 208 may beconfigured as a series of I-frames, P-frames and/or B-frames. The imageframes 208 may be received at the media device 102 in a digital datapackage multiplexed into the media content stream 104.

The audio stream portion 204 may also be communicated as digital data,here conceptually illustrated as discrete audio packets 210, that isprocessed and assembled into an audio stream. Presentation of the audioportion of the currently presented program is coordinated with thepresentation of the associated image frames 208.

Optionally, a meta data stream portion 206 may be included in the mediacontent stream 104. The information in the meta data stream portion 206is conceptually illustrated as discrete meta data information packets212. Information in the meta data information packets 212 includesinformation of interest, such as a channel or station identifier that isproviding the media content stream 104. Alternatively, or additionally,information in the meta data information packet 212 may also includeinformation pertaining to other characteristics of each particularprogram. For example, a first meta data information packet 212 a mayindicate, in part, the beginning of the program and/or a duration of theprogram. A last meta data information packet 212 b may indicate aconclusion of the program.

Exemplary embodiments are configured to access or otherwise retrieve theprogram of interest from the memory medium 120 (or another suitablememory medium) to determine a duration of the stored program. In anexemplary embodiment, the duration of the program of interest may bedetermined based upon information in the exemplary discrete meta datainformation packet 212 a. For example, the discrete meta datainformation packet 212 a may specify the program duration.Alternatively, or additionally, the discrete meta data informationpacket 212 a may specify a starting time of the program. The programduration may be determined if the last exemplary discrete meta datainformation packet 212 b indicates the conclusion time of the program.

The program duration may be determined in other manners. For example,the program duration may be specified in or determined from electronicprogram guide (EPG) information or the like. Or, the program durationmay be determined from the number of image frames 208 in the program andthe image frame presentation rate used during presentation of theprogram.

Since the program duration is known or determinable, the sceneseparation duration between each of the time-ordered series of stillimages and/or video clips may be determined by an exemplary embodimentof the media content scene index system 100. An exemplary embodimentdetermines the scene separation duration based upon the program durationand a desired number of thumbnail-sized still images and/or video clipsthat will be presented in the scene index 122. For example, if a desirednumber “x” of still images and/or video clips are to be presented in thescene index 122, then dividing the duration of the selected program ofinterest by the predefined number “x” determines the scene separationduration between the individual thumbnail-sized still images and/orvideo clips presented in the scene index 122.

In some embodiments, the scene separation durations are equal or aresubstantially equal. Alternatively, the scene separation durations mayvary. For example, but not limited to, a scene separation duration maybe adjusted so as not to fall onto a commercial or the like that is nota scene of the selected program of interest. Alternatively, oradditionally, the scene separation durations may be decreased when theamount of activity in that portion of the selected program of interestis relatively great, and/or may be increased when the amount of activityis relatively slow. Alternatively, or additionally, the user may specifythe scene separation duration.

In an exemplary embodiment, once the scene separation duration has beendetermined, a first image frame 208 a of the selected program ofinterest is retrieved from the memory medium 120. This first image frame208 a is at, or is near, the beginning of the selected program ofinterest. The selected first image frame 208 a is used to generate afirst thumbnail-sized still image that is presented on the scene index122. Accordingly, the generated thumbnail-sized still image correspondsto a scene image of the program. Any suitable image frame size reductionmethod may be used in the various embodiments to generate athumbnail-sized still image from the selected image frame.

The selected first image frame 208 a may be the very first image frameof the selected program of interest. Alternatively, the first imageframe 208 may occur after the start of the selected program of interestso that a more relevant scene may be presented to the user. For example,if a textual character recognition algorithm is incorporated into orused by an embodiment of the media content scene index system 100, afirst image frame 208 having text indicating the title of the selectedprogram of interest may be selected as the first image frame 208 a.

Alternatively, given the presumption that the user already knows thetitle of the selected program of interest, the first image frame 208 amay be selected based upon some other characteristic of the selectedprogram of interest. For example, but not limited to, a voicerecognition algorithm may be used to identify a scene with dialogue.Accordingly, the scene with dialogue may be selected as the first imageframe 208 a.

As another example, a face recognition algorithm and/or an objectrecognition algorithm may be used to identify a scene with actors,landscape, buildings, or other features. The scene with the recognizedfeature may be selected as the first image frame 208 a.

As yet another example, a color palate algorithm may be used to identifya scene with many colors therein, and/or to identify a transitionbetween scenes with relatively few colors to a scene with many colors.For example, image frames 208 at the beginning of the selected programof interest typically present text on a solid colored background, suchas may occur during the initial portion of the selected program ofinterest where the program title and/or opening credits are presented.Such title and opening credit scenes would typically have a relativelysmall number of different colors. In contrast, the photographed scenesof the selected program of interest will typically have many differentcolors therein. Accordingly, the scene with many colors, or where atransition from relatively few colors to many colors occurs, may beselected as the first image frame 208 a.

Additionally, or alternatively, an edge detection algorithm may be usedto identify the amount of scene activity. For example, the initial imageframes 208 presenting at the program title and/or opening creditstypically may not have a lot of edges, or may not have edges thereinthat are significantly moving from one image frame to the next. Incontrast, photographed scenes of the selected program of interest willtypically have more edges, and/or may have faster moving edges.Accordingly, the scene with more edges and/or faster moving edges may beselected as the first image frame 208 a.

Based on the determined (or predefined) scene separation duration, asecond image frame 208 b is selected. The second image frame 208 b is ator near the end of the first scene separation duration (T₁) [or is at ornear the beginning of the second scene separation duration (T₂)]. Theselected second image frame 208 b is used to generate the secondthumbnail-sized still image presented in the scene index 122. Similarly,a third image frame 208 c is selected that is at or near the end of thesecond scene separation duration (T₂) [or that is at or near thebeginning of the third scene separation duration (T₃)]. The selectedthird image frame 208 c is used to generate the third thumbnail-sizedstill image presented in the scene index 122. The process of selectingsubsequent ones of the image frames 208, based on the determined (orpredefined) scene separation duration continues until the conclusion ofthe selected program of interest is reached.

As each of the image frames 208 are selected, some embodiments areconfigured to evaluate the image frames about the start of (or end of)each scene separation duration. That is, the particular image frame 208that falls exactly on the start of (or end of) a scene separationduration may not be a scene that presents particularly relevantinformation to the user. For example, a black scene, a blurred scene, orthe like may be used to separate or facilitate transition betweendifferent scenes in the program. Such a transition scene would notimpart a significant amount of subject matter information to the userand would not be particularly desirable for generation of athumbnail-sized still image that is presented in the scene index 122.

In some situations, the selected image frame 208 may not have sufficientinformation to construct a complete scene image. For example, if themotion pictures expert group (MPEG) format is used, the image frames 208may be configured as a series of I-frames, P-frames and/or B-frames. TheP-frames and/or B-frames may not have sufficient information.Accordingly, one of the adjacent I-frames is selected as the image framefrom which a thumbnail-sized still image is generated. If a video clipis to be generated, then an I-frame will be selected as the first imageframe that is used to generate the video clip.

Since a particular selected image frame 208 may not provide a verydesirable scene image, some embodiments are configured to evaluate oneor more characteristics of a candidate image frame 208. Exemplaryembodiment may employ a text character recognition algorithm to identifytext, a voice recognition algorithm to evaluate one or more audiocharacteristics, a face recognition algorithm and/or an objectrecognition algorithm to evaluate one or more scene characteristics, acolor palate algorithm to evaluate one or more color characteristics,and/or an edge detection algorithm to evaluate one or more edgecharacteristics. Depending upon the nature of the characteristic, thatparticular image frame may be identified as an unsuitable candidate. A“best” one of the image frames may then be identified and selected asthe image frame that is used to generate the thumbnail-sized stillimage.

In some embodiments, a thumbnail-sized short video clip is generated forpresentation in the scene index 122. When a short thumbnail-sized videoclip is generated, a group of image frames 208 in proximity to the startof (or end of) each of the scene separation durations (T_(x)) areselected. The number of selected image frames 208 corresponds to aduration of the generated thumbnail-sized video clip. Thethumbnail-sized short duration video clips are then generated from theselected groups of the image frames 208.

The duration of the thumbnail-sized short duration video clips aresufficiently long so as to present enough video information to the userso that the user comprehends at least the general subject matter of thevideo clip. In an exemplary embodiment, the duration of the video clip(T_(n)) may be predefined in an exemplary embodiment. For example, butnot limited to, the duration (T_(n)) may be predefined to be tenseconds. Accordingly, a number of image frames 208 corresponding to aten second video can be determined so that when the group of imageframes 208 are selected and used to generate a correspondingthumbnail-sized video clip, the duration of the thumbnail-sized videoclip is ten seconds. In an exemplary embodiment, after conclusion of thepresentation of the thumbnail-sized short duration video clip,presentation of the thumbnail-sized short duration video clip loops backto the beginning so that the thumbnail-sized short duration video clipis continuously presented on the scene index 122 in a looped manner.

FIG. 3 is a block diagram of an embodiment of the media content sceneindex system 100 implemented in a media device 102 corresponding to aset top box (STB) that is configured to receive media content from amedia content provider. The non-limiting exemplary media device 102comprises a media content stream interface 302, a processor system 304,a memory 306, a program buffer 308, an optional digital video recorder(DVR) 310, a presentation device interface 312, a remote interface 314,and an optional external device interface 316. The memory 306 comprisesportions for storing the media content processing logic 318, the mediacontent indexing logic 320, and an optional indexed image database 322.In some embodiments, the media content processing logic 318 and/or themedia content indexing logic 320 may be integrated together, and/or maybe integrated with other logic. Other media devices may include some, ormay omit some, of the above-described media processing components.Further, additional components not described herein may be included inalternative embodiments.

The functionality of the media device 102, here a STB, is now broadlydescribed. One or more media content streams 104 are received by themedia content stream interface 302. The media content stream 104 isprovided to the media device 102 over a media content delivery system(not shown), such as, but not limited to, a satellite system, a cablesystem, or the Internet. The media content delivery system is coupled tothe media device at the media content stream interface 302.

The processor system 304, executing the content processing logic 318,and based upon a request for a program of interest specified by a user,tunes one of the tuners 324 in the media content stream interface 302 toone of the media content streams 104 that is providing the requestedprogram of interest. The media content received in the media contentstream 104 is then assembled into the video stream portion 202(comprising the image frames 208) and/or the audio stream portion 204(FIG. 2). The stream of video and audio information may then be storedby the program buffer 308 such that the media content can be streamedout to the media presentation device, such as the TV 106 and/or thesurround sound receiver 108, via the presentation device interface 312.Alternatively, or additionally, the media content may be saved into theDVR 310 for later presentation.

In an exemplary embodiment, in response to selection of a program ofinterest by the user, operation of media content scene index system 100is invoked. The processor system 304 loads and executes the mediacontent indexing logic 318 so that the stored selected program ofinterest is accessed from the DVR 310, wherein still images and/or videoclips are selected. The thumbnail-sized still images and/or video clipsare then generated based upon the selected still images and/or videoclips.

The optional indexed image database 322 is configured to store thegenerated thumbnail-sized still images and/or video clips.Alternatively, or additionally, the DVR 310 may be configured to storethe thumbnail-sized still images and/or the video clips selected from aprogram that is processed by the media content scene index system 100.

In an alternative embodiment, the selected still images and/or videoclips are saved into the indexed image data base 322 or in anothersuitable memory medium, such as the DVR 310. When the scene index 122 ispresented to the user, the saved time-ordered series of still imagesand/or video clips are then retrieved, and the thumbnail-sized stillimages and/or video clips are generated therefrom. The generatedthumbnail-sized still images and/or video clips are then presented inthe scene index 122.

In some instances, the entirety of the selected program of interest maynot have yet been received in the media content stream 104 and/or storedinto the DVR 310. That is, the remaining portion of the selected programof interest may still be in the process of being received by the mediadevice 102. In such instances, a previously received portion of theselected program of interest residing in the program buffer 308 and/orthe DVR 310 may be accessed to select still images and/or video clips.The thumbnail-sized still images and/or video clips for the receivedportion of the program of interest are then generated and presented inthe scene index 122. As additional media content is received, additionalstill images and/or video clips may be identified and selected, andcorresponding thumbnail-sized still images and/or video clips may begenerated therefrom.

In some situations, the selected program of interest may be accessedfrom an external device 326 that is communicatively coupled to the mediadevice 102, via the external device interface 316. For example, but notlimited to, the external device may be a DVD player or anotherelectronic device that is configured to access the selected program ofinterest from a persistent memory medium that is storing the selectedprogram of interest. As another example, the external device 326 may bea personal computer or other device that accesses the selected programof interest from another source, such as a web site or the like. In thissituation, the external device 326 accesses the memory media 328 onwhich the selected program of interest resides.

In some embodiments, the processing of the selected program of interestto generate the time-ordered series of thumbnail-sized still imagesand/or video clips occurs in response to specification of the selectedprogram of interest by the user. Processing the program of interest mayrequire some noticeable duration before the scene index 122 is presentedto the user. Accordingly, some embodiments are configured to continuepresentation of a currently presented program until the processing ofthe program of interest is completed. Other embodiments may switch topresentation of the selected program of interest at some predefinedpoint, such as at the beginning of the selected program of interest, atthe last viewed scene in the selected program of interest, or at anyother suitable scene. Some embodiments may present a selected stillimage of, or related to, the selected program of interest. Otherembodiments may simply present a notification image indicating to theuser that the processing of the selected program of interest isunderway.

Some embodiments may be configured to generate and save thethumbnail-sized still images and/or video clips (or alternatively,select and save the full sized still images and/or video clips fromwhich the thumbnails are generated) as selected programs are received,or as selected programs are saved into the DVR 310 or into anothermemory medium, such as the memory medium 328 of the external device 326.It is often the case that a program of interest is saved when it isreceived in the media content stream 104 at a time when the user is notable to view the program. Accordingly, when the user is initiallypresented the scene index 122, the thumbnail-sized still images and/orvideo clips (and/or their full sized still images and/or video clipsfrom which the thumbnails are generated) are already available forquicker access.

In this situation, a predefined scene separation duration is used toidentify and select the still images and/or video clips. For example,with a predefined scene separation duration of ten minutes, a sixtyminute program would have a still image and/or video clip selected everyten minutes until the conclusion of the program of interest. Here, sevenfull sized still images and/or video clips will be selected (and/orseven thumbnail-sized still images and/or video clips will be generatedand saved). If the scene index 122 is presented prior to completion ofthe processing, the plurality of thumbnail-sized still images and/orvideo clips are presented in a time ordered sequence up to a currentpresentation point of the program of interest.

Alternatively, or additionally, some embodiments may generate and savethe time-ordered series of thumbnail-sized still images and/or videoclips (and/or their full sized still images and/or video clips fromwhich the thumbnails are generated) as the program is being receivedfrom a remote source, such as a video on demand (VOD) system. Here, theembodiment would use a predefined scene separation duration.Alternatively, the scene separation duration may be defined based uponthe known duration of the program of interest that is received from theVOD system.

Some embodiments may be configured to indicate the source of the storedmedia content to the user. For example, if the source of the storedmedia content is the local DVR 310, then the user will appreciate thatupon selection of a particular thumbnail-sized still images and/or videoclips from the scene index 122, presentation will begin with little orno delay. In contrast, if the source of the stored media content is theexternal memory medium 318, then the user will appreciate that uponselection of a particular thumbnail-sized still image and/or video clip,there may be a delay in the presentation of the media content. In suchsituations, the user may make a more informed choice, such as byelecting to access the media content from another source and/or at alater time.

For example, the media content may have to be remotely accessed,processed, and then streamed to the media device 102. In some system,such as a peer-to-peer network or client-server architecture, the mediacontent may require a relatively large amount of processing time togenerate a stream of media content that is suitable for transmission tothe media device 102.

FIG. 4 is a simplified screen shot illustrating an exemplary alternativescene index 400 generated by an embodiment of the media content sceneindex system 100. The exemplary scene index 400 includes a first portion402 and a second portion 404.

This first portion 402 presents a program listing. The program listingpresents a series of programs A-E that are available for presentation,such as programs that have been locally stored on the DVR 310 (FIG. 3),or that are available from the memory media 328 or a VOD system. (In anexemplary embodiment, the displayed text may correspond to the programtitle and/or may provide additional information, such as a subjectmatter descriptor, a recording date, an original presentation date, aprogram series identifier, or the like.) The second portion 404 presentsthe time-ordered series of thumbnail-sized still images and/or videoclips associated with the currently highlighted one of the listedprograms.

The first portion 402 of the scene index 400 is configured so that theuser may initially navigate (such as by scrolling up or down) to one ofthe listed programs. When one of the programs is suitably highlighted orotherwise identified, the thumbnail-sized still images and/or videoclips associated with that particular program are presented in thesecond portion 404 of the scene index 400. To demonstrate, the “ProgramB” listed in the first portion 402 of the scene index 400 is illustratedas being the highlighted or otherwise identified program of interest, asdenoted by the bold lined box. Thus, the presented thumbnail-sized stillimages and/or video clips are understood by the user to be scenesselected from Program B.

The second portion 404 of the scene index 400 is configured so that theuser may then navigate (such as by scrolling up, down, left and/orright) to a particular one of the time-ordered series of thumbnail-sizedstill images and/or video clips. For example, the thumbnail-sized stillimage and/or video clip denoted as “E” is illustrated as beinghighlighted or otherwise identified. Accordingly, if the user selectsthe “E” thumbnail-sized still image and/or video clip, presentation ofthe associated program (here “Program B”) will begin at the scene thatcorresponds to the selected “E” thumbnail-sized still image and/or videoclip.

The exemplary scene index 400 is particularly suited for assisting auser to identify and navigate to a particular scene of interest in aseries of related program episodes. For example, but not limited to, aweekly television series may be stored, or may be otherwise accessible,at the media device 102. For example, the “Program B” might be thesecond program episode of the current season of the television serieshaving Programs A-E currently available for viewing. (If additionalepisodes are available, they may be accessed on one or more continuingpages of the scene index 400). The user, upon a review of the presentedthumbnail-sized still images and/or video clips, may realize that theyhave already viewed that highlighted episode. The user, scrolling to thenext episode, could quickly ascertain whether or not they have viewedthat episode. Accordingly, the user is able to move about and view thethumbnail-sized still images and/or video clips for a plurality ofdifferent episodes, identify an episode of interest, and then havepresentation of the selected episode of interest begin at a selectedscene of interest (associated with a particular one of thethumbnail-sized still images and/or video clips).

FIG. 5 is a simplified screen shot illustrating an exemplary alternativescene index 500 generated by an embodiment of the media content sceneindex system 100. The exemplary scene index 500 includes a first portion502 that presents a full sized, or substantially full sized, video imageof the “Program B” that is currently being presented on the display 110of the TV 106. Optionally, the audio track of the currently presentedprogram may be presented.

The exemplary scene index 500 includes a second portion 504 thatpresents a portion of the time-ordered series of thumbnail-sized stillimages and/or video clips associated with the currently presentedprogram. The exemplary second portion 504 is configured as a horizontalbanner with a single row of thumbnail-sized still images and/or videoclips that is presented below the portion 502. Alternative embodimentsmay present multiple rows of thumbnail-sized still images and/or videoclips in any suitable format. An alternative embodiment may present thehorizontal banner above the first portion 502.

Using this exemplary embodiment, the user is able to view and hear thecurrently presented program while concurrently viewing the presentedplurality of thumbnail-sized still images and/or video clips. Thecurrently presented program shown in the first portion of the sceneindex 500 may be a presented program that has been previously stored inthe DVR 310 (FIG. 3), may be a program accessed from the memory medium328 by the external device 326, or may be a program currently receivedin the media content stream 104 or a VOD system.

The thumbnail-sized still images and/or video clips A-C, and anyavailable thumbnail-sized still images and/or video clips that are pastthe thumbnail-sized still image and/or video clip H, are not shown inthe exemplary scene index 500 due to the limited size of the secondportion 504 of the scene index 500. A reverse navigation button 506 may,in an exemplary embodiment, be provided so that the user has access tothe earlier thumbnail-sized still images and/or video clips A-C. Aforward navigation button 508 may be provided so that the user hasaccess to any available thumbnail-sized still images and/or video clipsthat are past the thumbnail-sized still image and/or video clip H. Usingthis exemplary embodiment, the user may “jump back” or “jump forward” toa scene in the currently presented program by navigating about thesecond portion 504 of the scene index 500.

In some instances, the Program B may be in the process of beingcurrently received in the media content stream 104 by the media device102 (FIGS. 1 and 3). Thus, the last shown thumbnail-sized still imageand/or video clip H may correspond to a last generated, and thus a mostrecent, thumbnail-sized still image and/or video clip. In this instance,the user can navigate about the second portion 504 and select one of thethumbnail-sized still images and/or video clips A-H so that thepresentation of Program B “jumps back” to a prior presented scene ofinterest in Program B. In this exemplary instance, the still imageand/or video clip E is highlighted. Accordingly, upon selection of thethumbnail-sized still image and/or video clip E, the presentation ofProgram B will “jump back” to resume presentation at a scene in ProgramB that corresponds to the selected thumbnail-sized still image and/orvideo clip E.

FIG. 6 is a simplified screen shot illustrating an exemplary alternativescene index 600 generated by an embodiment of the media content sceneindex system 100. This exemplary scene index 600 is similar to theexemplary scene index 500 (FIG. 5). The exemplary scene index 600includes a first portion 602 that presents a full sized, orsubstantially full sized, video image of the currently presentedprogram. Optionally, the current audio track of the currently presentedprogram may be presented.

The exemplary scene index 600 also includes a second portion 604 thatpresents a portion of the time-ordered series of thumbnail-sized stillimages and/or video clips associated with the currently presentedprogram. The second portion 604 is presented as a vertically orientedbanner to the right of the first portion 602. Alternatively, the secondportion 604 may be presented as a vertically oriented banner to the leftof the first portion 602. With this exemplary embodiment, the user may“jump back” or “jump forward” to a scene in the currently presentedprogram by using the navigation buttons 606, 608 to navigate about thesecond portion 604 of the scene index 600.

FIG. 7 is a simplified screen shot illustrating an exemplary alternativescene index 700 generated by an embodiment of the media content sceneindex system 100. This exemplary scene index 700 presents the currentlyhighlighted thumbnail-sized still image and/or video clip as a largersized still image and/or video clip 702. Thus, the user may readily seegreater detail in the scene presented by the currently highlightedthumbnail-sized still image and/or video clip. Here, the thumbnail-sizedstill image and/or video clip D has been highlighted by the user whilenavigating about the scene index 700.

In an exemplary embodiment, presentation of the larger sized still imageand/or video clip 702 continues until the user initiates some otheraction. For example, the user may navigate to another one of thethumbnail-sized still images and/or video clips shown in the scene index122. In this situation, the larger sized still image and/or video clip702 would be replaced by a currently highlighted one of thethumbnail-sized still images and/or video clips of the scene index 122.If the user initiates some other action, presentation of the largersized still image and/or video clip 702 ends, and the media device 102responds to the user initiated action.

Alternatively, or additionally, presentation of the larger sized stillimage and/or video clip 702 may last for a predefined display duration.After expiration of the display duration, presentation of the largersized still image and/or video clip 702 ends. For example, when thedisplay duration is five seconds, the larger sized still image and/orvideo clip 702 is initially presented. After five seconds, presentationof the larger sized still image and/or video clip 702 ends. Someembodiments may use a display duration corresponding to the duration ofa currently presented thumbnail-sized video clip.

The exemplary larger sized still image and/or video clip 702 may be usedin conjunction with, or integrated with, other scene index embodiments.For example, the larger sized still image and/or video clip 702 may bepresented with the selected thumbnail-sized still images and/or videoclips of the scene index 122 (FIG. 1).

FIG. 8 is a simplified screen shot illustrating an exemplary alternativescene index 800 generated by an embodiment of the media content sceneindex system 100. The exemplary scene index 800 includes a first portion802 that presents a full sized, or substantially full sized, video imageof the currently presented program. Optionally, the current audio trackof the currently presented program may be presented.

The exemplary scene index 800 also includes a second portion 804 thatindicates a thumbnail-sized still image and/or video clip associatedwith the currently presented program that is ahead of, or behind, thecurrently presented scene of the presented program. The exemplary secondportion 804 is configured as a pop-up type image that may be presentedin any suitable location of display 110 of the TV 106.

In this exemplary embodiment, the user invokes the media content sceneindex system 100 to present the next occurring one of the time-orderedseries of still images and/or video clips. Alternatively, a previouslyoccurring one of the time-ordered series of still images and/or videoclips may be presented in the second portion 804.

For example, the user may be interested in jumping ahead in thepresentation of the selected program of interest. Similarly, the usermay be interested in jumping back in the presentation of the selectedprogram of interest. By viewing the next occurring one of thetime-ordered series of still images and/or video clips going forward orbackward, the user will appreciate the nature of the scene that will beforthcoming in the selected program of interest. If the user so chooses,upon selection of the presented next occurring one of the time-orderedseries of still images and/or video clips, presentation will jumpforward to the scene corresponding to that next occurring one of thetime-ordered series of still images and/or video clips.

The exemplary scene index 800 is particularly suited for assisting auser to rapidly advance forward through, or move back through, acurrently presented program of interest. For example, generation of thetime-ordered series of thumbnail-sized still images and/or video clipsmay be based upon some predefined scene separation duration, such asevery few seconds or minutes. Since the generated thumbnail-sized stillimages and/or video clips may be previously generated and stored in theindexed image database 322, they may be quickly accessed in a sequentialmanner and presented in the second portion 804 of the scene index 800.Thus, the user can skip ahead or move backward to find a scene ofinterest, and then cause the media content scene index system 100 toinitiate or resume presentation at the scene of interest identified bythe user.

The above-described embodiments present the scene indexes on apresentation device communicatively coupled to the media device 102,such as the exemplary TV 106. Alternatively, or additionally, the sceneindexes may be presented on other suitable presentation devices. Forexample, the scene indexes may be presented on a laptop computer, apersonal computer, a cellular phone, a personal device assistant (PDA)or the like.

Some embodiments may be configured to save, archive, and/or communicatea selected one of the thumbnail-sized still images and/or video clips.For example, the user may wish to e-mail one or more selectedthumbnail-sized still images and/or video clips to a friend. The friend,upon viewing the communicated thumbnail-sized still images and/or videoclips, may then better appreciate the user's experience and thus have amore meaningful discourse regarding the program. Alternatively, oradditionally, the friend may choose to initiate viewing of the mediacontent based upon the received thumbnail-sized still images and/orvideo clips which indicate an approximate location of interest in themedia content.

Some embodiments may be further configured, based on the selected sceneindex 122, to generate a short duration video clip from a selected stillimage scene index 122. The resulting generated short video clip may alsobe saved, archived, e-mailed, etc. as described above.

It should be emphasized that the above-described embodiments of themedia content scene index system 100 are merely possible examples ofimplementations of the invention. Many variations and modifications maybe made to the above-described embodiments. All such modifications andvariations are intended to be included herein within the scope of thisdisclosure and protected by the following claims.

The invention claimed is:
 1. A method for indicating program scenes of aprogram, the method comprising: selecting a series of image frames froma plurality of time sequenced image frames, wherein the selected imageframes are separated temporally from each other by a scene separationduration, wherein the selecting further comprises for each of theselected image frames: evaluating at least one characteristic of theselected image frame, wherein when the characteristic is text, theevaluating comprises identifying a scene with text, wherein theevaluated selected image, frame is unsuitable for presentation when thescene with text is identified; and wherein when the characteristic iscolor, the evaluating comprises identifying a scene with a solid coloredbackground, wherein the evaluated selected image frame is unsuitable forpresentation when the scene with the solid colored background isidentified; and selecting another image frame in proximity to theunsuitable selected image frame when the evaluation indicates that theselected image frame is unsuitable for presentation; generating aplurality of thumbnail-sized images, wherein one thumbnail-sized imageis generated from each of the suitable image frames; and presenting theplurality of thumbnail-sized images on a scene index, wherein the sceneindex is configured to present the plurality of thumbnail-sized imagesin a time ordered sequence corresponding to a subject matterpresentation sequence of the program.
 2. The method of claim 1,selecting the other image frame in proximity to the unsuitable selectedimage frame after the evaluation comprises: evaluating the at least onecharacteristic of the selected other image frame, wherein the evaluatedcharacteristic indicates that the selected other image frame is suitablefor presentation to the user when the selected other image frame permitsthe user to comprehend the scene subject matter, and wherein theevaluated characteristic indicates that the selected other image frameis unsuitable for presentation to the user when the selected image framewill not permit the user to comprehend the scene subject matter.
 3. Themethod of claim 1, wherein the selected other image frame is a firstother image frame that is unsuitable for presentation to the user, andfurther comprising: selecting a second other image frame in proximity tothe unsuitable first other image frame when the evaluation indicatesthat the first other image frame is unsuitable for presentation, whereinthe second other image frame is used to generate the correspondingthumbnail-sized image that is used in the scene index.
 4. The method ofclaim 1, wherein the scene index that is configured to present theplurality of thumbnail-sized images in the time ordered sequencecomprises a number of thumbnail-sized images that are all presentable atthe same time on a display using a plurality of rows of thumbnail-sizedimages, the method further comprising before selecting the series ofimage frames: determining a duration of the program; and determining thescene separation duration based on the number of the plurality ofthumbnail-sized images to be presented on the scene index, wherein thenumber of thumbnail-sized images presents a series of scene images thatsubstantially span the duration of the program.
 5. The method of claim1, further comprising: receiving the user selection of one of theplurality of thumbnail-sized images; identifying in the program aplayback starting point corresponding to a start of the scene associatedwith the selected thumbnail-sized still image; and retrieving theprogram from a memory medium, wherein presentation of the program beginsat the playback staring point.
 6. The method of claim 1, wherein theevaluated characteristic is a recognized face characteristic, theevaluating comprises: identifying a scene with actors when facecharacteristics are recognized, wherein the evaluated selected imageframe is unsuitable for presentation when the scene with no actors isidentified.
 7. The method of claim 6, when the recognized facecharacteristic is the evaluated characteristic, selecting another imageframe comprises: serially selecting and evaluating image frames that arein proximity to the evaluated selected image frame that was identifiedas having the scene with no actors, wherein the selecting and evaluatingcontinues until a scene with actors is identified; and selecting theimage frame having the scene with actors for generating the plurality ofthumbnail-sized images.
 8. The method of claim 1, further comprising:storing the generated plurality of thumbnail-sized images; andretrieving the stored plurality of thumbnail-sized images to present thescene index, wherein the retrieving is performed in response toreceiving the user selection of the program.
 9. The method of claim 1,further comprising: storing the selected image frames; and retrievingthe stored selected image frames to generate the plurality ofthumbnail-sized images, wherein the retrieving is performed in responseto receiving the user selection of the program.
 10. The method of claim1, wherein the program is a first program, wherein the scene index is afirst scene index associated with the first program, wherein theplurality of thumbnail-sized images are a plurality of firstthumbnail-sized images used in the first scene index, and wherein asecond program has a second scene index that includes a plurality ofsecond thumbnail-sized images that have been generated from the secondprogram, the method further comprising: presenting an electronic programguide (EPG) on a display, wherein a list of a plurality of programs thatare available for presentation is presented on a first portion of theEPG, and wherein the list of the plurality of programs includes thefirst program and the second program; receiving a user instruction thatcauses one of the first program and the second program to be highlightedon the EPG; presenting the first scene index with the plurality of firstthumbnail-sized images on a second portion of the EPG in response toreceiving the user instruction that highlights the first program; andpresenting the second scene index with the plurality of secondthumbnail-sized images on the second portion of the EPG in response toreceiving the user instruction that highlights the second program. 11.The method of claim 1, when text is the evaluated characteristic,selecting another image frame comprises: serially selecting andevaluating image frames that are in proximity to the evaluated selectedimage frame that was identified as having the scene with the text,wherein the selecting and evaluating continues until a scene with notext is identified; and selecting the image frame having the scene withno text for generating the plurality of thumbnail-sized images.
 12. Themethod of claim 1, when color is the evaluated characteristic, selectinganother image frame comprises: serially selecting and evaluating imageframes that are in proximity to the evaluated selected image frame thatwas identified as having the scene with the solid colored background,wherein the selecting and evaluating continues until a transition occursfrom the scene with the solid colored background to a scene with aplurality of different colors; and selecting the image frame having theplurality of different colors for generating the plurality ofthumbnail-sized images.
 13. The method of claim 1, wherein when theevaluated characteristic is a recognizable object, the evaluatingcomprises: identifying a scene with the recognizable object, wherein theevaluated selected image frame is unsuitable for presentation when thescene with no recognizable object is identified.
 14. The method of claim13, when the recognizable object is the evaluated characteristic,selecting another image frame comprises: serially selecting andevaluating image frames that are in proximity to the evaluated selectedimage frame that was identified as having the scene with no recognizableobject, wherein the selecting and evaluating continues until a scenewith the recognizable object is identified; and selecting the imageframe having the scene with the recognizable object for generating theplurality of thumbnail-sized images.
 15. The method of claim 1, whereinwhen the evaluated characteristic is an audio characteristic, theevaluating comprises: identifying a scene with dialogue, wherein theevaluated selected image frame is unsuitable for presentation when thescene with no dialogue is identified.
 16. The method of claim 15, whendialogue is the evaluated characteristic, selecting another image framecomprises: serially selecting and evaluating image frames that are inproximity to the evaluated selected image frame that was identified ashaving the scene with no dialogue, wherein the selecting and evaluatingcontinues until a scene with dialogue is identified; and selecting theimage frame having the scene with the dialogue for generating theplurality of thumbnail-sized images.
 17. A media device, comprising: aninterface configured to receive a program delivered in a media contentstream; and a processor system communicatively coupled to the interface,wherein the processor system is operable to: select a plurality of imageframes from the program based upon a scene separation duration, whereinselecting the plurality of image frames further comprises, for each ofthe selected image frames: evaluate at least one characteristic of eachof the selected image frames, wherein the evaluated characteristicindicates that the selected image frame is suitable for presentation toa user when the selected image frame does not present text on a solidbackground, and wherein the evaluated characteristic indicates that theselected image frame is unsuitable for presentation to the user when theselected image frame presents the text on the solid background; andselect another image frame in proximity to the unsuitable selected imageframe when the evaluation indicates that the selected image frame isunsuitable for presentation, wherein the other image frame does notpresent the text on the solid background; generate a plurality ofthumbnail-sized images, wherein one thumbnail-sized image is generatedfrom each of the suitable image frames and the selected other imageframes associated with unsuitable image frames; generate a scene indexthat is configured to present the plurality of thumbnail-sized images ina time ordered sequence corresponding to a subject matter presentationsequence of the program; and a presentation device interfacecommunicatively coupled to the processor system, and configured tocommunicatively couple the media device to a presentation device with adisplay, wherein the scene index is communicated from the presentationdevice interface to the presentation device for presentation on thedisplay.
 18. The media device of claim 17, further comprising: a remoteinterface configured to: receive a first user instruction to present anelectronic program guide (EPG) on the display, wherein the EPG presentsa list of a plurality of programs that are available for presentation ona first portion of the EPG, and wherein the list of the plurality ofprograms includes the program; and receive a second user instructionthat causes a selected one of the programs shown on the list of the EPGto be highlighted; wherein the first scene index with the plurality ofthumbnail-sized images associated with the selected program is presentedon a second portion of the EPG in response to receiving the second userinstruction.
 19. The media device of claim 18, further comprising: amemory medium configured to store the generated plurality ofthumbnail-sized images; and an external device interface configured tocommunicatively couple the media device to an external device, whereinthe external device interface is configured to receive a plurality ofother scene indexes associated with a plurality of other programs fromthe external device, and wherein the other scene indexes comprise aplurality of thumbnail-sized images, wherein the received other sceneindexes are saved into the memory medium, and wherein one of the otherscene indexes are presented on the second portion of the EPG in responseto receiving a third user instruction that highlights one of the otherprograms listed on the EPG.
 20. The media device of claim 18, furthercomprising: a memory medium configured to store a plurality of otherscene indexes associated with a plurality of other programs, and whereinthe other scene indexes comprise a plurality of thumbnail-sized images,wherein the received other scene indexes are saved into the memorymedium, and wherein when the other programs are listed on the EPG, andin response to receiving a third user instruction that highlights one ofthe other programs, the scene index associated with the highlighted oneof the other programs is presented on the second portion of the EPG.